Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 22.319
Filtrar
1.
Cogn Sci ; 48(4): e13438, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38605457

RESUMO

Numerous studies have found that selective attention affects category learning. However, previous research did not distinguish between the contribution of focusing and filtering components of selective attention. This study addresses this issue by examining how components of selective attention affect category representation. Participants first learned a rule-plus-similarity category structure, and then were presented with category priming followed by categorization and recognition tests. Additionally, to evaluate the involvement of focusing and filtering, we fit models with different attentional mechanisms to the data. In Experiment 1, participants received rule-based category training, with specific emphasis on a single deterministic feature (D feature). Experiment 2 added a recognition test to examine participants' memory for features. Both experiments indicated that participants categorized items based solely on the D feature, showed greater memory for the D feature, were primed exclusively by the D feature without interference from probabilistic features (P features), and were better fit by models with focusing and at least one type of filtering mechanism. The results indicated that selective attention distorted category representation by highlighting the D feature and attenuating P features. To examine whether the distorted representation was specific to rule-based training, Experiment 3 introduced training, emphasizing all features. Under such training, participants were no longer primed by the D feature, they remembered all features well, and they were better fit by the model assuming only focusing but no filtering process. The results coupled with modeling provide novel evidence that while both focusing and filtering contribute to category representation, filtering can also result in representational distortion.


Assuntos
Atenção , Aprendizagem , Humanos , Rememoração Mental , Reconhecimento Psicológico , Formação de Conceito
2.
Sci Rep ; 14(1): 8660, 2024 04 15.
Artigo em Inglês | MEDLINE | ID: mdl-38622177

RESUMO

Agriculture plays a pivotal role in the economic development of a nation, but, growth of agriculture is affected badly by the many factors one such is plant diseases. Early stage prediction of these disease is crucial role for global health and even for game changers the farmer's life. Recently, adoption of modern technologies, such as the Internet of Things (IoT) and deep learning concepts has given the brighter light of inventing the intelligent machines to predict the plant diseases before it is deep-rooted in the farmlands. But, precise prediction of plant diseases is a complex job due to the presence of noise, changes in the intensities, similar resemblance between healthy and diseased plants and finally dimension of plant leaves. To tackle this problem, high-accurate and intelligently tuned deep learning algorithms are mandatorily needed. In this research article, novel ensemble of Swin transformers and residual convolutional networks are proposed. Swin transformers (ST) are hierarchical structures with linearly scalable computing complexity that offer performance and flexibility at various scales. In order to extract the best deep key-point features, the Swin transformers and residual networks has been combined, followed by Feed forward networks for better prediction. Extended experimentation is conducted using Plant Village Kaggle datasets, and performance metrics, including accuracy, precision, recall, specificity, and F1-rating, are evaluated and analysed. Existing structure along with FCN-8s, CED-Net, SegNet, DeepLabv3, Dense nets, and Central nets are used to demonstrate the superiority of the suggested version. The experimental results show that in terms of accuracy, precision, recall, and F1-rating, the introduced version shown better performances than the other state-of-art hybrid learning models.


Assuntos
Rememoração Mental , Reconhecimento Psicológico , Agricultura , Algoritmos , Doenças das Plantas
3.
Nat Commun ; 15(1): 3081, 2024 Apr 10.
Artigo em Inglês | MEDLINE | ID: mdl-38594279

RESUMO

Tactile sensation and vision are often both utilized for the exploration of objects that are within reach though it is not known whether or how these two distinct sensory systems combine such information. Here in mice, we used a combination of stereo photogrammetry for 3D reconstruction of the whisker array, brain-wide anatomical tracing and functional connectivity analysis to explore the possibility of tacto-visual convergence in sensory space and within the circuitry of the primary visual cortex (VISp). Strikingly, we find that stimulation of the contralateral whisker array suppresses visually evoked activity in a tacto-visual sub-region of VISp whose visual space representation closely overlaps with the whisker search space. This suppression is mediated by local fast-spiking interneurons that receive a direct cortico-cortical input predominantly from layer 6 neurons located in the posterior primary somatosensory barrel cortex (SSp-bfd). These data demonstrate functional convergence within and between two primary sensory cortical areas for multisensory object detection and recognition.


Assuntos
Neurônios , Tato , Camundongos , Animais , Neurônios/fisiologia , Tato/fisiologia , Interneurônios , Reconhecimento Psicológico , Córtex Somatossensorial/fisiologia , Vibrissas/fisiologia
4.
Sensors (Basel) ; 24(7)2024 Mar 28.
Artigo em Inglês | MEDLINE | ID: mdl-38610396

RESUMO

The increasing popularity of pigs has prompted farmers to increase pig production to meet the growing demand. However, while the number of pigs is increasing, that of farm workers has been declining, making it challenging to perform various farm tasks, the most important among them being managing the pigs' health and welfare. This study proposes a pattern mining-based pig behavior analysis system to provide visualized information and behavioral patterns, assisting farmers in effectively monitoring and assessing pigs' health and welfare. The system consists of four modules: (1) data acquisition module for collecting pigs video; (2) detection and tracking module for localizing and uniquely identifying pigs, using tracking information to crop pig images; (3) pig behavior recognition module for recognizing pig behaviors from sequences of cropped images; and (4) pig behavior analysis module for providing visualized information and behavioral patterns to effectively help farmers understand and manage pigs. In the second module, we utilize ByteTrack, which comprises YOLOx as the detector and the BYTE algorithm as the tracker, while MnasNet and LSTM serve as appearance features and temporal information extractors in the third module. The experimental results show that the system achieved a multi-object tracking accuracy of 0.971 for tracking and an F1 score of 0.931 for behavior recognition, while also highlighting the effectiveness of visualization and pattern mining in helping farmers comprehend and manage pigs' health and welfare.


Assuntos
Algoritmos , Reconhecimento Psicológico , Suínos , Animais , Fazendas , Análise de Sistemas
5.
Sensors (Basel) ; 24(7)2024 Mar 29.
Artigo em Inglês | MEDLINE | ID: mdl-38610405

RESUMO

With the increase in the scale of breeding at modern pastures, the management of dairy cows has become much more challenging, and individual recognition is the key to the implementation of precision farming. Based on the need for low-cost and accurate herd management and for non-stressful and non-invasive individual recognition, we propose a vision-based automatic recognition method for dairy cow ear tags. Firstly, for the detection of cow ear tags, the lightweight Small-YOLOV5s is proposed, and then a differentiable binarization network (DBNet) combined with a convolutional recurrent neural network (CRNN) is used to achieve the recognition of the numbers on ear tags. The experimental results demonstrated notable improvements: Compared to those of YOLOV5s, Small-YOLOV5s enhanced recall by 1.5%, increased the mean average precision by 0.9%, reduced the number of model parameters by 5,447,802, and enhanced the average prediction speed for a single image by 0.5 ms. The final accuracy of the ear tag number recognition was an impressive 92.1%. Moreover, this study introduces two standardized experimental datasets specifically designed for the ear tag detection and recognition of dairy cows. These datasets will be made freely available to researchers in the global dairy cattle community with the intention of fostering intelligent advancements in the breeding industry.


Assuntos
Agricultura , Reconhecimento Psicológico , Animais , Feminino , Bovinos , Fazendas , Indústrias , Inteligência
6.
Sensors (Basel) ; 24(7)2024 Apr 02.
Artigo em Inglês | MEDLINE | ID: mdl-38610479

RESUMO

In recent years, the advancement of generative techniques, particularly generative adversarial networks (GANs), has opened new possibilities for generating synthetic biometric data from different modalities, including-among others-images of irises, fingerprints, or faces in different representations. This study presents the process of generating synthetic images of human irises, using the recent StyleGAN3 model. The novelty presented in this work consists in producing generated content in both Cartesian and polar coordinate representations, typically used in iris recognition pipelines, such as the foundational work proposed by John Daugman, but hitherto not used in generative AI experiments. The main objective of this study was to conduct a qualitative analysis of the synthetic samples and evaluate the iris texture density and suitability for meaningful feature extraction. During this study, a total of 1327 unique irises were generated, and experimental results carried out using the well-known OSIRIS open-source iris recognition software and the equivalent software, wordlcoin-openiris, newly published at the end of 2023 to prove that (1) no "identity leak" from the training set was observed, and (2) the generated irises had enough unique textural information to be successfully differentiated between both themselves and between them and real, authentic iris samples. The results of our research demonstrate the promising potential of synthetic iris data generation as a valuable tool for augmenting training datasets and improving the overall performance of iris recognition systems. By exploring the synthetic data in both Cartesian and polar representations, we aim to understand the benefits and limitations of each approach and their implications for biometric applications. The findings suggest that synthetic iris data can significantly contribute to the advancement of iris recognition technology, enhancing its accuracy and robustness in real-world scenarios by greatly augmenting the possibilities to gather large and diversified training datasets.


Assuntos
Biometria , Iris , Humanos , Reconhecimento Psicológico , Software , Tecnologia
7.
Sensors (Basel) ; 24(7)2024 Apr 05.
Artigo em Inglês | MEDLINE | ID: mdl-38610534

RESUMO

This study explores the important role of assessing force levels in accurately controlling upper limb movements in human-computer interfaces. It uses a new method that combines entropy to improve the recognition of force levels. This research aims to differentiate between different levels of isometric contraction forces using electroencephalogram (EEG) signal analysis. It integrates eight different entropy measures: power spectrum entropy (PSE), singular spectrum entropy (SSE), logarithmic energy entropy (LEE), approximation entropy (AE), sample entropy (SE), fuzzy entropy (FE), alignment entropy (PE), and envelope entropy (EE). The findings emphasize two important advances: first, including a wide range of entropy features significantly improves classification efficiency; second, the fusion entropy method shows exceptional accuracy in classifying isometric contraction forces. It achieves an accuracy rate of 91.73% in distinguishing between 15% and 60% maximum voluntary contraction (MVC) forces, along with 69.59% accuracy in identifying variations across 15%, 30%, 45%, and 60% MVC. These results illuminate the efficacy of employing fusion entropy in EEG signal analysis for isometric contraction detection, heralding new opportunities for advancing motor control and facilitating fine motor movements through sophisticated human-computer interface technologies.


Assuntos
Eletroencefalografia , Contração Isométrica , Humanos , Entropia , Movimento , Reconhecimento Psicológico
8.
Sci Rep ; 14(1): 7760, 2024 04 02.
Artigo em Inglês | MEDLINE | ID: mdl-38565934

RESUMO

Disrupted or atypical light-dark cycles disrupts synchronization of endogenous circadian clocks to the external environment; extensive circadian rhythm desynchrony promotes adverse health outcomes. Previous studies suggest that disrupted circadian rhythms promote neuroinflammation and neuronal damage post-ischemia in otherwise healthy mice, however, few studies to date have evaluated these health risks with aging. Because most strokes occur in aged individuals, we sought to identify whether, in addition to being a risk factor for poor ischemic outcome, circadian rhythm disruption can increase risk for vascular cognitive impairment and dementia (VCID). We hypothesized that repeated 6 h phase advances (chronic jet lag; CJL) for 8 weeks alters cerebrovascular architecture leading to increased cognitive impairments in aged mice. Female CJL mice displayed impaired spatial processing during a spontaneous alternation task and reduced acquisition during auditory-cued associative learning. Male CJL mice displayed impaired retention of the auditory-cued associative learning task 24 h following acquisition. CJL increased vascular tortuosity in the isocortex, associated with increased risk for vascular disease. These results demonstrate that CJL increased sex-specific cognitive impairments coinciding with structural changes to vasculature in the brain. We highlight that CJL may accelerate aged-related functional decline and could be a crucial target against disease progression.


Assuntos
Ritmo Circadiano , Demência Vascular , Animais , Camundongos , Masculino , Feminino , Ritmo Circadiano/fisiologia , Fotoperíodo , Reconhecimento Psicológico , Demência Vascular/etiologia , Cognição
9.
JASA Express Lett ; 4(4)2024 Apr 01.
Artigo em Inglês | MEDLINE | ID: mdl-38573045

RESUMO

The present study examined English vowel recognition in multi-talker babbles (MTBs) in 20 normal-hearing, native-English-speaking adult listeners. Twelve vowels, embedded in the h-V-d structure, were presented in MTBs consisting of 1, 2, 4, 6, 8, 10, and 12 talkers (numbers of talkers [N]) and a speech-shaped noise at signal-to-noise ratios of -12, -6, and 0 dB. Results showed that vowel recognition performance was a non-monotonic function of N when signal-to-noise ratios were less favorable. The masking effects of MTBs on vowel recognition were most similar to consonant recognition but less so to word and sentence recognition reported in previous studies.


Assuntos
Idioma , Fala , Adulto , Humanos , Reconhecimento Psicológico , Razão Sinal-Ruído
10.
Cereb Cortex ; 34(4)2024 Apr 01.
Artigo em Inglês | MEDLINE | ID: mdl-38602743

RESUMO

The gyrus, a pivotal cortical folding pattern, is essential for integrating brain structure-function. This study focuses on 2-Hinge and 3-Hinge folds, characterized by the gyral convergence from various directions. Existing voxel-level studies may not adequately capture the precise spatial relationships within cortical folding patterns, especially when relying solely on local cortical characteristics due to their variable shapes and homogeneous frequency-specific features. To overcome these challenges, we introduced a novel model that combines spatial distribution, morphological structure, and functional magnetic resonance imaging data. We utilized spatio-morphological residual representations to enhance and extract subtle variations in cortical spatial distribution and morphological structure during blood oxygenation, integrating these with functional magnetic resonance imaging embeddings using self-attention for spatio-morphological-temporal representations. Testing these representations for identifying cortical folding patterns, including sulci, gyri, 2-Hinge, and 2-Hinge folds, and evaluating the impact of phenotypic data (e.g. stimulus) on recognition, our experimental results demonstrate the model's superior performance, revealing significant differences in cortical folding patterns under various stimulus. These differences are also evident in the characteristics of sulci and gyri folds between genders, with 3-Hinge showing more variations. Our findings indicate that our representations of cortical folding patterns could serve as biomarkers for understanding brain structure-function correlations.


Assuntos
Reconhecimento Psicológico , Feminino , Masculino , Humanos , Membrana Celular
11.
Cogn Res Princ Implic ; 9(1): 21, 2024 Apr 10.
Artigo em Inglês | MEDLINE | ID: mdl-38598036

RESUMO

The use of partially-automated systems require drivers to supervise the system functioning and resume manual control whenever necessary. Yet literature on vehicle automation show that drivers may spend more time looking away from the road when the partially-automated system is operational. In this study we answer the question of whether this pattern is a manifestation of inattentional blindness or, more dangerously, it is also accompanied by a greater attentional processing of the driving scene. Participants drove a simulated vehicle in manual or partially-automated mode. Fixations were recorded by means of a head-mounted eye-tracker. A surprise two-alternative forced-choice recognition task was administered at the end of the data collection whereby participants were quizzed on the presence of roadside billboards that they encountered during the two drives. Data showed that participants were more likely to fixate and recognize billboards when the automated system was operational. Furthermore, whereas fixations toward billboards decreased toward the end of the automated drive, the performance in the recognition task did not suffer. Based on these findings, we hypothesize that the use of the partially-automated driving system may result in an increase in attention allocation toward peripheral objects in the road scene which is detrimental to the drivers' ability to supervise the automated system and resume manual control of the vehicle.


Assuntos
Cegueira , Transtornos Mentais , Humanos , Automação , Coleta de Dados , Reconhecimento Psicológico
12.
J Neural Eng ; 21(2)2024 Apr 11.
Artigo em Inglês | MEDLINE | ID: mdl-38565099

RESUMO

Objective.The study of emotion recognition through electroencephalography (EEG) has garnered significant attention recently. Integrating EEG with other peripheral physiological signals may greatly enhance performance in emotion recognition. Nonetheless, existing approaches still suffer from two predominant challenges: modality heterogeneity, stemming from the diverse mechanisms across modalities, and fusion credibility, which arises when one or multiple modalities fail to provide highly credible signals.Approach.In this paper, we introduce a novel multimodal physiological signal fusion model that incorporates both intra-inter modality reconstruction and sequential pattern consistency, thereby ensuring a computable and credible EEG-based multimodal emotion recognition. For the modality heterogeneity issue, we first implement a local self-attention transformer to obtain intra-modal features for each respective modality. Subsequently, we devise a pairwise cross-attention transformer to reveal the inter-modal correlations among different modalities, thereby rendering different modalities compatible and diminishing the heterogeneity concern. For the fusion credibility issue, we introduce the concept of sequential pattern consistency to measure whether different modalities evolve in a consistent way. Specifically, we propose to measure the varying trends of different modalities, and compute the inter-modality consistency scores to ascertain fusion credibility.Main results.We conduct extensive experiments on two benchmarked datasets (DEAP and MAHNOB-HCI) with the subject-dependent paradigm. For the DEAP dataset, our method improves the accuracy by 4.58%, and the F1 score by 0.63%, compared to the state-of-the-art baseline. Similarly, for the MAHNOB-HCI dataset, our method improves the accuracy by 3.97%, and the F1 score by 4.21%. In addition, we gain much insight into the proposed framework through significance test, ablation experiments, confusion matrices and hyperparameter analysis. Consequently, we demonstrate the effectiveness of the proposed credibility modelling through statistical analysis and carefully designed experiments.Significance.All experimental results demonstrate the effectiveness of our proposed architecture and indicate that credibility modelling is essential for multimodal emotion recognition.


Assuntos
Benchmarking , Emoções , Fontes de Energia Elétrica , Eletroencefalografia , Reconhecimento Psicológico
13.
PLoS One ; 19(4): e0300440, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38598505

RESUMO

The automatic detection of the degree of surface corrosion on metal structures is of significant importance for assessing structural damage and safety. To effectively identify the corrosion status on the surface of coastal metal facilities, this study proposed a CBG-YOLOv5s model for metal surface corrosion detection, based on the YOLOv5s model. Firstly, we integrated the Convolutional Block Attention Module (CBAM) into the C3 module and developed the C3CBAM module. This module effectively enhanced the channel and spatial attention capabilities of the feature map, thereby improving the feature representation. Second, we introduced a multi-scale feature fusion concept in the feature fusion part of the model and added a small target detection layer to improve small target detection. Finally, we designed a lighter C3Ghost module, which reduced the number of parameters and the computational load of the model, thereby improving the running speed of the model. In addition, to verify the effectiveness of our method, we constructed a dataset containing 6000 typical images of metal surface corrosion and conducted extensive experiments on this dataset. The results showed that compared to the YOLOv5s model and several other commonly used object detection models, our method achieved superior performance in terms of detection accuracy and speed.


Assuntos
Utensílios Domésticos , Reconhecimento Psicológico , Corrosão , Metais
14.
Neural Netw ; 174: 106235, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38564978

RESUMO

Recently, Vision Transformer (ViT) has achieved promising performance in image recognition and gradually serves as a powerful backbone in various vision tasks. To satisfy the sequential input of Transformer, the tail of ViT first splits each image into a sequence of visual tokens with a fixed length. Then, the following self-attention layers construct the global relationship between tokens to produce useful representation for the downstream tasks. Empirically, representing the image with more tokens leads to better performance, yet the quadratic computational complexity of self-attention layer to the number of tokens could seriously influence the efficiency of ViT's inference. For computational reduction, a few pruning methods progressively prune uninformative tokens in the Transformer encoder, while leaving the number of tokens before the Transformer untouched. In fact, fewer tokens as the input for the Transformer encoder can directly reduce the following computational cost. In this spirit, we propose a Multi-Tailed Vision Transformer (MT-ViT) in the paper. MT-ViT adopts multiple tails to produce visual sequences of different lengths for the following Transformer encoder. A tail predictor is introduced to decide which tail is the most efficient for the image to produce accurate prediction. Both modules are optimized in an end-to-end fashion, with the Gumbel-Softmax trick. Experiments on ImageNet-1K demonstrate that MT-ViT can achieve a significant reduction on FLOPs with no degradation of the accuracy and outperform compared methods in both accuracy and FLOPs.


Assuntos
Reconhecimento Psicológico
15.
Behav Brain Funct ; 20(1): 8, 2024 Apr 18.
Artigo em Inglês | MEDLINE | ID: mdl-38637870

RESUMO

One important role of the TPJ is the contribution to perception of the global gist in hierarchically organized stimuli where individual elements create a global visual percept. However, the link between clinical findings in simultanagnosia and neuroimaging in healthy subjects is missing for real-world global stimuli, like visual scenes. It is well-known that hierarchical, global stimuli activate TPJ regions and that simultanagnosia patients show deficits during the recognition of hierarchical stimuli and real-world visual scenes. However, the role of the TPJ in real-world scene processing is entirely unexplored. In the present study, we first localized TPJ regions significantly responding to the global gist of hierarchical stimuli and then investigated the responses to visual scenes, as well as single objects and faces as control stimuli. All three stimulus classes evoked significantly positive univariate responses in the previously localized TPJ regions. In a multivariate analysis, we were able to demonstrate that voxel patterns of the TPJ were classified significantly above chance level for all three stimulus classes. These results demonstrate a significant involvement of the TPJ in processing of complex visual stimuli that is not restricted to visual scenes and that the TPJ is sensitive to different classes of visual stimuli with a specific signature of neuronal activations.


Assuntos
Imageamento por Ressonância Magnética , Lobo Parietal , Humanos , Lobo Parietal/fisiologia , Reconhecimento Psicológico , Neuroimagem , Análise Multivariada , Estimulação Luminosa , Reconhecimento Visual de Modelos/fisiologia , Percepção Visual/fisiologia , Mapeamento Encefálico/métodos
16.
Artigo em Inglês | MEDLINE | ID: mdl-38598402

RESUMO

Canonical correlation analysis (CCA), Multivariate synchronization index (MSI), and their extended methods have been widely used for target recognition in Brain-computer interfaces (BCIs) based on Steady State Visual Evoked Potentials (SSVEP), and covariance calculation is an important process for these algorithms. Some studies have proved that embedding time-local information into the covariance can optimize the recognition effect of the above algorithms. However, the optimization effect can only be observed from the recognition results and the improvement principle of time-local information cannot be explained. Therefore, we propose a time-local weighted transformation (TT) recognition framework that directly embeds the time-local information into the electroencephalography signal through weighted transformation. The influence mechanism of time-local information on the SSVEP signal can then be observed in the frequency domain. Low-frequency noise is suppressed on the premise of sacrificing part of the SSVEP fundamental frequency energy, the harmonic energy of SSVEP is enhanced at the cost of introducing a small amount of high-frequency noise. The experimental results show that the TT recognition framework can significantly improve the recognition ability of the algorithms and the separability of extracted features. Its enhancement effect is significantly better than the traditional time-local covariance extraction method, which has enormous application potential.


Assuntos
Interfaces Cérebro-Computador , Humanos , Potenciais Evocados Visuais , Reconhecimento Automatizado de Padrão/métodos , Reconhecimento Psicológico , Eletroencefalografia/métodos , Algoritmos , Estimulação Luminosa
17.
Sci Rep ; 14(1): 8527, 2024 04 12.
Artigo em Inglês | MEDLINE | ID: mdl-38609463

RESUMO

Recognising objects is a vital skill on which humans heavily rely to respond quickly and adaptively to their environment. Yet, we lack a full understanding of the role visual information sampling plays in this process, and its relation to the individual's priors. To bridge this gap, the eye-movements of 18 adult participants were recorded during a free-viewing object-recognition task using Dots stimuli1. Participants viewed the stimuli in one of three orders: from most visible to least (Descending), least visible to most (Ascending), or in a randomised order (Random). This dictated the strength of their priors along the experiment. Visibility order influenced the participants' recognition performance and visual exploration. In addition, we found that while orders allowing for stronger priors generally led participants to visually sample more informative locations, this was not the case of Random participants. Indeed, they appeared to behave naïvely, and their use of specific object-related priors was fully impaired, while they maintained the ability to use general, task-related priors to guide their exploration. These findings have important implications for our understanding of perception, which appears to be influenced by complex cognitive processes, even at the basic level of visual sampling during object recognition.


Assuntos
Movimentos Oculares , Percepção Visual , Adulto , Humanos , Reconhecimento Psicológico , Registros
18.
Sci Rep ; 14(1): 8739, 2024 04 16.
Artigo em Inglês | MEDLINE | ID: mdl-38627572

RESUMO

Inspired by recent findings in the visual domain, we investigated whether the stimulus-evoked pupil dilation reflects temporal statistical regularities in sequences of auditory stimuli. We conducted two preregistered pupillometry experiments (experiment 1, n = 30, 21 females; experiment 2, n = 31, 22 females). In both experiments, human participants listened to sequences of spoken vowels in two conditions. In the first condition, the stimuli were presented in a random order and, in the second condition, the same stimuli were presented in a sequence structured in pairs. The second experiment replicated the first experiment with a modified timing and number of stimuli presented and without participants being informed about any sequence structure. The sound-evoked pupil dilation during a subsequent familiarity task indicated that participants learned the auditory vowel pairs of the structured condition. However, pupil diameter during the structured sequence did not differ according to the statistical regularity of the pair structure. This contrasts with similar visual studies, emphasizing the susceptibility of pupil effects during statistically structured sequences to experimental design settings in the auditory domain. In sum, our findings suggest that pupil diameter may serve as an indicator of sound pair familiarity but does not invariably respond to task-irrelevant transition probabilities of auditory sequences.


Assuntos
Pupila , Som , Feminino , Humanos , Pupila/fisiologia , Reconhecimento Psicológico , Percepção Auditiva/fisiologia
19.
PLoS One ; 19(4): e0301839, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38630706

RESUMO

Traditional optical flame detectors (OFDs) in flame detection are susceptible to environmental interference, which will inevitably cause detection errors and miscalculations when confronted with a complex environment. The conventional deep learning-based models can mitigate the interference of complex environments by flame image feature extraction, which significantly improves the precision of flame recognition. However, these models focus on identifying the general profile of the static flame, but neglect to effectively locate the source of the dynamic flame. Therefore, this paper proposes a novel dynamic flame detection method named Dynamic Deformable Adaptive Framework (DDAF) for locating the flame root region dynamically. Specifically, to address limitations in flame feature extraction of existing detection models, the Deformable Convolution Network v2 (DCNv2) is introduced for more flexible adaptation to the deformations and scale variations of target objects. The Context Augmentation Module (CAM) is used to convey flame features into Dynamic Head (DH) to feature extraction from different aspects. Subsequently, the Layer-Adaptive Magnitude-based Pruning (LAMP) where the connection with the smallest LAMP score is pruned sequentially is employed to further enhance the speed of model detection. More importantly, both the coarse- and fine-grained location techniques are designed in the Inductive Modeling (IM) to accurately delineate the flame root region for effective fire control. Additionally, the Temporal Consistency-based Detection (TCD) contributes to improving the robustness of model detection by leveraging the temporal information presented in consecutive frames of a video sequence. Compared with the classical deep learning method, the experimental results on the custom flame dataset demonstrate that the AP0.5 value is improved by 4.4%, while parameters and FLOPs are reduced by 25.3% and 25.9%, respectively. The framework of this research extends applicability to a variety of flame detection scenarios, including industrial safety and combustion process control.


Assuntos
Aprendizado Profundo , Cultura , Reconhecimento Psicológico
20.
PLoS One ; 19(4): e0295301, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38630733

RESUMO

Cross-cultural research has elucidated many important differences between people from Western European and East Asian cultural backgrounds regarding how each group encodes and consolidates the contents of complex visual stimuli. While Western European groups typically demonstrate a perceptual bias towards centralised information, East Asian groups favour a perceptual bias towards background information. However, this research has largely focused on the perception of neutral cues and thus questions remain regarding cultural group differences in both the perception and recognition of negative, emotionally significant cues. The present study therefore compared Western European (n = 42) and East Asian (n = 40) participants on a free-viewing task and a subsequent memory task utilising negative and neutral social cues. Attentional deployment to the centralised versus background components of negative and neutral social cues was indexed via eye-tracking, and memory was assessed with a cued-recognition task two days later. While both groups demonstrated an attentional bias towards the centralised components of the neutral cues, only the Western European group demonstrated this bias in the case of the negative cues. There were no significant differences observed between Western European and East Asian groups in terms of memory accuracy, although the Western European group was unexpectedly less sensitive to the centralised components of the negative cues. These findings suggest that culture modulates low-level attentional deployment to negative information, however not higher-level recognition after a temporal interval. This paper is, to our knowledge, the first to concurrently consider the effect of culture on both attentional outcomes and memory for both negative and neutral cues.


Assuntos
Viés de Atenção , Sinais (Psicologia) , Humanos , Tecnologia de Rastreamento Ocular , Atenção , Reconhecimento Psicológico
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...